Search AI Products and News
Explore worldwide AI information, discover new AI opportunities
- ✓AI News
- AI Tools
2025-07-18 09:57:18.AIbase.
5.63% Error Rate Sets New Low: NVIDIA AI Launches Commercial-Grade Ultra-High-Speed Speech Recognition Model Canary-Qwen-2.5B
2025-07-14 09:36:45.AIbase.
The Ultimate TTS Tool for Films! IndexTTS2 Zero-Shot Cloning + Emotion Control A Revolutionary Breakthrough in Dubbing!
2025-07-07 17:36:29.AIbase.
Stream-Omni: Supports Various Modalities Combination Interaction, Opening the Era of Text, Vision, and Speech Integration
2025-07-04 11:13:59.AIbase.
Open Source Revolution! Kyutai TTS Launches: Ultra-Low Latency Speech Synthesis, the New Era of AI Voice is Here!
2025-07-04 09:48:54.AIbase.
Kyutai Labs abre el código de Kyutai TTS: tecnología de síntesis de voz en tiempo real con bajo latencia
2025-07-02 16:19:47.AIbase.
Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand Audio and Generate Natural Speech Directly
2025-07-01 14:07:56.AIbase.
TEN VAD Shocks Open Source: Enterprise-Level Speech Detection Tool, Creating a Super Intelligent AI Voice Assistant!
2025-07-01 11:25:55.AIbase.
TEN Agent Open Source TEN VAD and Turn Detection Enable Ultra-Low Latency for Speech AI
2025-07-01 11:01:49.AIbase.
Qwen-TTS Launches with Major Breakthrough in Dialect Speech Synthesis, Realism Comparable to Human Voices
2025-07-01 08:42:27.AIbase.
New Release of Qwen-TTS Adds Support for Three Chinese Dialects
2025-06-30 14:54:35.AIbase.
New Open Source AI System OmniGen 2: Integrates Image and Text Generation Like GPT-4o
2025-06-30 09:27:58.AIbase.
Runway AI Launches Its New Game World: A Large Interactive Text Adventure
2025-06-26 14:25:53.AIbase.
Google Launches Imagen4: Breaking the Text-to-Image Generation Bottleneck, Gemini API Empowers Text-to-Image
2025-06-25 08:48:03.AIbase.
ElevenLabs Launches Mobile App Free Users Get 10 Minutes of Text-to-Speech Credit
2025-06-24 10:01:19.AIbase.
From Text Generation to Instruction Editing: OmniGen2 Redefines Application Scenarios for Open-Source Multimodal Models
2025-06-19 16:11:19.AIbase.
Tongyi APP Upgrades Translation Capabilities to Create the Strongest Translation Complex
2025-06-18 15:55:26.AIbase.
Apple's New Speech Technology Takes the Field! 34-Minute 4K Video Transcription Completed in Only 45 Seconds, Speed Exceeds OpenAI by 55%
2025-06-18 11:09:50.AIbase.
Apple's new Speech API transcribes at an impressive speed, surpassing OpenAI Whisper by 55%
2025-06-17 11:17:00.AIbase.
Comprehensive Review of UntitledPen: Full Analysis of an AI Voice Generation Tool - How to Create Natural Voice Content
2025-06-17 11:08:40.AIbase.